AI032
Programming Massively Parallel Processors: A Hands-on Approach
Performance Analysis and SIMT Execution
Learning Objectives
- Evaluate the SIMT execution model's efficiency on parallel workloads
- Identify performance bottlenecks related to branch divergence and serialization
- Analyze memory latency hiding techniques within warp scheduling
- Calculate utilization and occupancy metrics for GPU kernels